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Abstract 

Cooperativity plays an important role in the action of proteins 
bound to DNA. A simple, mechanical mechanism for cooperativity, in 
the form of a tension-mediated interaction between proteins bound to 
DNA at two different locations, is proposed. These proteins are not 
in direct physical contact. DNA segments intercalating bound pro- 
teins are modeled as a Worm-Like Chain, which is free to deform in 
two dimensions. The tension-controlled protein-protein interaction is 
the consequence of two effects produced by the protein binding. The 
first is the introduction of a bend in the host DNA and the second is 
the modification of the bending modulus of the DNA in the immediate 
vicinity of the bound protein. The interaction between two bound pro- 
teins may be either attractive or repulsive, depending on their relative 
orientation on the DNA. Applied tension controls both the strength 
and the range of protein-protein interactions in this model. Properties 
of the cooperative interaction are discussed, along with experimental 
implications. 
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The cooperative binding of proteins to DNA plays a significant role in the 
regulation of gene expression |Pwen-Hughes and Workman, 199~4}| since it al- 
lows a sensitive response to small changes in protein concentration. In par- 
ticular, it is well known that transcription factor proteins [[Lodish et al, 1995 
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exhibit a significant level of cooperativity [ |Sun et al, 1997)1 . The structural 
basis of the cooperativity is not fully understood |[Sun et al, 1997 



but it is 



known that long-range cooperativity is possible through loops [ [Schlief, 1992| | , 
formed as the result of association between two DNA-binding proteins. Loop- 
ing is also believed to play an important role in gene access control. Coop- 
erativity at shorter distances may be related to specific protein-protein in- 
teractions or to a generic cooperativity resulting from structural distortions 
induced by the binding of a protein to DNA [Lilley, 1995| , |Nelson, 1995|| . For 



example, the binding of transcription regulation proteins such as the impor- 
tant TATA-box promoters (TPB) involves amino-acid intercalation into the 
stack of base pairs |[Kim, et al, 1993| , [Werner, et al, 1996|| . The result is that 
kinks are produced — sharp local bending angles in the DNA strand. This de- 
formation may permit a better fit for other DNA-associating proteins, such 
as the polymerases. Disruptions of the base-pair stacking sequence have 
no effect beyond about half a turn of the double helix |[Kim, et al, 1993 , 
Werner, et al, 1996| , so it is expected that this form of cooperativity is re- 
stricted to the immediate neighborhood of the primary binding protein. 
Protein-induced deformations of the DNA strand are not restricted to tran- 
scription factors. DNA may wrap itself once or more around a protein as 
happens in the case of complexation of DNA with nucleosomes, the gyrase 
enzyme, or bacterial RNA. Interestingly, nucleosome-binding appears to be 



cooperative with transcription factors ||Owen-Hughes and Workman, 1994 



DNA-deforming proteins will be referred to below as "architectural" pro- 
teins. 

The aim of this article to demonstrate the possibility of a variable-range 
form of cooperative DNA binding of architectural proteins with a range and 
strength that is regulated by the tension along the DNA strand. The co- 
operative interaction between proteins that are not in physical contact, is 
mediated by the deformation of the intervening DNA strand. In the ab- 
sence of tension, the proposed mechanism is absent. Our demonstration of 
the possibility of tension-controlled cooperativity is based on an analysis of 
the "Worm-Like Chain" (WLC) model of DNA elasticity | |Hagerman, 1981| , 
Kam, et al, 198 If , which has been used in studies of single protein bind- 
ing to DNA [ Marko and Siggia, 1997 1- The WLC model is characterized 
by a single parameter, a length-scale £ p , known as the persistence length. 
It is the distance over which a (tensionless) WLC maintains orientational 
order in the presence of thermal fluctuations. That is to say, the autocor- 
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relation between orientational order at two different locations of the chain 
falls off with distance, £, as e~ £ ^ p . Fitting the results of micro-mechanical 
in-vitro studies of protein-free DNA chains to the predictions of the WLC 
model yields good results for a persistence length of about 50 nanometers 
[ [Bustamante, et al, 1994J , [Bensimon, et al, 1994|| , although longer persistence 
lengths have been reported by different methods fBednar, et al, 1995 . 



We employ the WLC model only to evaluate binding cooperativity due 
to deformations produced by architectural proteins in sections of the DNA 
strand that are not in the immediate neighborhood of the binding proteins 
themselves. The WLC model will not apply reliably when the two proteins 
are so close together that details of base-pair action (i.e. roll, slide and 
twist) play an important role in the mediation of their interaction. The 
primary binding of the protein itself is characterized by two phenomenolog- 
ical parameters: the single protein, zero-tension specific binding free energy 
E^(s) and the DNA bending angle a; see Figure [I]. The specific binding 
energy E^'(s) is a sequence-sensitive quantity that depends on the location 
of the protein along the chain. It is usually in the range of 10 to 30 ksT 
with ks Boltzmann's constant and T the temperature in degrees Kelvin. 
At room temperature ksT ~ 0.6 kcal/mole. By comparison, the individ- 
ual ionic bonds between base pairs in DNA are in the range 2-3 kcal/mole, 
and typical covalent bounds are the order of 60-120 kcal/mole. The index 
s refers to the position of the protein along the DNA strand. The protein- 
DNA complex will be assumed to be rigid for the tension levels envisioned, 
so the bending angle a is independent of tension. The values of E^'(s) and 
a must be obtained either experimentally or by detailed structural modelling 
of DNA-protein interactions. 

We will focus on the results of analytical and numerical studies that utilize 
the WLC model to compute the deformation energy of a DNA chain under 
tension with two identical architectural proteins attached and separated by 
a distance I. The two proteins are assumed to induce a bend into the DNA 
strand without any twisting. We find that there is, in general, both an 
enthalpic and an entropic contribution to the binding cooperativity proposed 



here |[Wang and Ciaever, 19881 - The enthalpic cooperative correction to the 



binding energy E^ 2 ' between the two proteins (denoted by 1 and 2) that 
are separated by a distance I = \si — s 2 \ assumed large compared to the 
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characteristic dimensions of the protein is given by 



i£L - E{ 1 \s 1 ) + 4 1} ( S2 ) - 2a 2 ^T (J^j (l =F e H— >l/«^) (1) 

This formula, derived in Appendix A, holds when the bending angle a is 
small compared to ir/2. The case of large bending angle is discussed later. 
The plus sign in Eq. || refers to the "symmetric" case with the two proteins 
bound on the same side of DNA while the minus sign refers to the "anti- 
symmetric" case with the two proteins bound on opposite sides of the DNA 
(see Figure [3]) • The chain-tension F in Eq. [I] is variable but assumed to be 
in the the range of 10~ 2 to 10 pN. The tension-dependent length-scale £(F) 
in Eq. |l] is of key importance; it sets the range of the binding cooperativity. 
This quantity is defined by: 



m = f-^§ L (2) 

For a one pN tension, £(F) is about 7 nm, while for a 10~ 2 pN force it is 
about 70 nm. However, Eq. [I] is only valid as long as the "tension-length" 
£(F) is less than the persistence length £ p . For larger tensions £(F) is small 
compared to the persistence length £ p m 50 nm. Note that all parameters in 
Eq. P) can, in principle, be determined experimentally. 

According to Eq. [I], there is no bending-induced cooperativity between 
two proteins separated by a distance large compared to the tension length 
£(F). In that limit, the only effect of tension is to reduce the single protein 
binding energy by an amount per protein A.E enth = a 2 JkBT^ p F (using Eq. 



|2| to eliminate £(F) in Eq. |l|). This tension- induced reduction of the protein 
binding energy is discussed |[Marko and Siggia, 1997|| for the case of single- 



protein binding to DNA. For a one pN tension, the binding energy reduction 
is significant: about lo^k^T . With increasing tension, the protein will be 
released from the DNA chain when IAE cn ^ starts to approach the protein 
binding free energy 

1 If the DNA-protein interaction involves a number of turns of the DNA around the 
protein, then we must add the quantity ALF to Ai? ent h, with AL the excess DNA length 
wound around the protein. For low tensions F, this correction is small compared to 
AE'cnth, but for a one pN tension, it is comparable. 
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A summary of expected values for the cooperative interaction between two 
bound proteins is displayed in the table 1. It is assumed that the bending 
angle that each enforces is 45°. Other quantities are appropriate to DNA at 
room temperature. 

When the spacing between the two proteins is reduced to within a dis- 
tance of order the tension-length £(F), then the binding energy increases 
exponentially for the antisymmetric arrangement while it decreases expo- 
nentially for the symmetric arrangement. We can interpret the last term in 
Eq. [I] as an effective potential energy of interaction, V ent h (I) , between two 
proteins given by: 



with I = \s\ — S2I the interprotein spacing. The minus sign in Eq. |3] is 
for the anti-symmetric case. In Appendix A we compute Knth(0 f° r values 
of the bending angle a that range up to n/2. The result is shown in Fig. 
0. Note that both the vertical and horizontal axes are dimensionless. The 
energy has been expressed in units of 2ksT (£ P /£(F)), the energy scale of the 
tension-induced binding energy reduction Ai^enth (see Eqs. [I] and |3]), while 
the distance is expressed in units of the tension length, £(F). It follows from 
Fig. ^ that an increase in tension reduces the range of the interaction, as 
expected from Eq. [|, while it increases the strength of the cooperativity. If 
the spacing I between the proteins is small compared to the tension length 
then the effective potential V(0) cancels the Ai? ent h term in Eq. [p. 
The enthalpic energy gain obtained by bringing two proteins together along 
the chain from a large separation is equal to twice AE enth . 

For the symmetric case, the effective potential energy is repulsive. The 
corresponding energy plots are shown in Figure [| The enthalpic deforma- 
tional energy now increases as the two proteins approach each other. In the 
limit of small bending angle a, two adjacent bending proteins with a in the 
symmetric conformation have the same tension-induced binding energy re- 
duction as a single protein with a double bending angle of 2a. The energy 
scale for the cooperativity is thus in general set by Ai^nth- The effective 
interaction potential is, then, always less than the single protein binding en- 
ergy since proteins are expected to unbind when Ai? ent h spacing approaches 




(3) 



E^(s). 
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Although it would appear as if the enthalpic cooperativity depends on 
temperature within the WLC model (see Eqs. [I] and |3|), this is not the case: 
£p is inversely proportional to ksT (see Appendix A) so neither AE enth nor 
£(F) depend on fc^T. There is, however, a purely entropic contribution to the 
cooperativity that is explicitly dependent on ksT. In Appendix B, we obtain 
the following expression for the entropic correction to the cooperativity: 



AE® = k B T 



d 




21 

1 - e 



(4) 



for two proteins of length d separated by a distance I = \s\ — S2I ■ The 
entropic contribution does not depend on the bending angle a and is the 
same for the symmetric and antisymmetric configurations. In the limit of 
protein separations that are large compared to the tension length £(F), the 
entropic contribution AE ent r = h^E^ (00) to the single-protein binding 
energy is: 

A ^ =fcBT {_^_ ln ( 1+ _y} (5) 

This is, again, a negative quantity: the local constraints imposed on the 
DNA chain by the two binding proteins lowers the entropy of the complex 
as compared to a free chain and hence reduces the binding energy. The 
difference can, again, be interpreted as an effective entropic potential energy 
of interaction, which is now entropic. It is given by 

(( -^—Y e~«^) \ 
1 - ^p 1 72 (6) 
4 ( 1 + ^y) J 

This effective entropic potential energy is always attractive. In Fig. [| we 
show the entropic potential energy for two proteins of size d equal to 20 
Angstroms and for a one pN applied tension (£ (F = lpN) = 70 A) . 

For larger bending angles, the entropic interaction is both weaker and 
shorter in range than the enthalpic interaction. However, since this interac- 
tion does not depend on the magnitude of the bending angle, it dominates 
for zero bending angles, or bending angles that are very small. An interest- 
ing special case concerns the entropic interaction between two long strings of 
binding proteins. If we model a polymerized string of proteins bound to DNA 
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as a rigid section of size d, with d assumed large compared to the tension 
length, and with zero total bending angle, then two such strings are expected 
to have an effective entropic interaction potential given by:[] 

V entT (l) fc B Tln(l-e-5&) 

d/£(F) -> oo (7) 

The free energy of two long rigid strings separated by a small gap is lowered by 
an amount of order ksT if the intervening gap is filled in either by shifting one 
of the two strings to close the gap or by adding additional binding proteins 
inside the gap. This effect provide us with a curious entropic stabilization of 
mechanism of polymerization of proteins along DNA strands. 

When we increase the bending angle beyond tt/2, new physical effects 
appear. As shown in Figures ^ and ^, there are in general two possible 
configurations for a symmetric two protein/DNA complex. Up to now, it 
has been tacitly assumed that the "S" or "stretched" configuration was the 
appropriate one (as shown in Fig.[l]) and indeed the S configuration has the 
lower free energy for bending angles a less than tt/2. However, when the 
bending angle exceeds tt/2 this is no longer the case. For low tensions, the 
"L" , or "looped" configuration has in fact the lower elastic free energy while 
for higher tensions, the S configuration is more stable. The two regimes 
are separated by a mathematical singularity which has the character of a 
first-order phase transition. In Fig. [7| we show the extension, X, of the two- 
kink configuration for which the kink angle is greater than tt/2, as a function 
of tension, F. There is a transition from the L to the S configuration with 
increasing tension visible as a discontinuity of the extension at the transition 
point. There is no transition in the antisymmetric case. 

In summary, we have shown that, within the confines of the WLC model, 
tension can trigger cooperative binding for anti-symmetrically arranged ar- 
chitectural proteins. The binding strength has both an enthalpic and an 
entropic contribution, and it has an appreciable magnitude for tensions of 
the order of one pN or higher. For larger bending angles, we find two compet- 
ing configurations connected by a tension-induced phase-transition. Both the 
enthalpic and entropic contributions to cooperativity vanish in the limit of 

2 Formally, there is a divergence in Eq. [j] for small spacings /, but Eq. |?| should, of 
course, not be expected to retain validity when I approaches a base-pair spacing. 
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zero tension (see Eqs. |I] and ^|). This last result is certainly not self-evident. 
The decrease in chain entropy imposed by two rigid sections can, for instance, 
be expected to depend on the spacing, even for zero tension. Interestingly, 
a study of the interaction between two stiff inclusions inside a two dimen- 
sional surface (such as a membrane) reports Pruinsma, et al, 1994|| that in 
this case there is a long-range zero-tension interaction with both entropic 
and enthalpic contributions, both dropping off as the inverse fourth power 
of the spacing between the inclusions. The disappearance of tension-induced 
cooperativity at F = thus must be related to the fact that we are dealing 
with a one- dimensional geometry. 

Experimental in vitro tests of the proposed mechanism can be performed 
by preparing a bundle of DNA strands, each strand containing bacterial gene 
operator sequences periodically spaced by a distance of / base pairs. The 
associated repressor protein (such as the lac repressor) binding specifically to 
the operator sites would induce local kinks at the operator sites. According 
to the model, the logarithm of the equilibrium repressor-operator binding 
constant Kro contains a contribution that depends on the operator spacing 
I and the tension F of the DNA bundle according to Eq. |l|. 

A study of the kinetics of cooperative protein-DNA association can also 
be a testing ground. For instance, since the TATA box binding protein TBP 
is known to produce a large bending angle, the one- dimensional diffusion 
along the DNA of other proteins required for the RNA polymerase initiation 
complex, such as TFIIE, H and J, that are non-specifically bound to DNA 
will be speeded up by bending-induced cooperativity. The reason is that the 
effective potential AV(l) turns the random one-dimensional diffusion into a 
directed process. It should be noted here that the weaker non-specific bind- 
ing of DNA associating proteins will not deform the DNA strand as much 
as specific binding. However, studies of the dependence of non-specifically 
bound 434 repressors on the DNA flexibility indicate that non-specific bond- 
ing also involves distortions of the local DNA structure, so there still ought 
to be a tension-controlled interaction between specific and non-specifically 
bound proteins ||Hogan and Austin, 1987] . We thus predict that (modest) 



tension will actually increase the formation rate of the RNA polymerase ini- 
tiation complex. At high tension levels, the formation rate will decrease with 
tension for reasons discussed earlier. 

Another possible area where the present theory could be applied is histone- 
DNA interactions. According to our model calculations, a collection of 
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non-specifically bound proteins ought to adopt an antisymmetric zig-zag 
configuration under tension. The binding of histones to DNA is reported 
to produce a zig-zag nucleosome structure consistent with our calculations 
[Thoma, et al, 1979"! . Under tension, the zig-zag structure should thus be 
stabilized. It should be kept in mind, though, that the intervening linker 
histones may well affect the competition between different configurations 



Thoma, et al, 197 9 



An important question for the relevance of the work presented here is 
whether DNA is under tension under in-vivo conditions. DNA strands in 
suspended in good solvent are in fact not under tension. We believe that this 
is an exceptional case. A DNA strand whose ends are fixed is, even for low ex- 
tensions, typically under a tension in the range of 0.01 pN due to thermal fluc- 
tuations, as shown by the micromechanical studies Pustamante, et al, 1994 



Bcnsimon, et al, 1994|| . For extensions closer to one, the tension can be 
much larger. If a DNA strand is subject to the activity of force-transducing 
proteins — for instance during mitosis, RNA transcription, or homologous 
recombination — then much higher tensions can be generated. It is known 
that a single motor protein is able to generate a force of 10 pN or more 



|Yin, et al, 1995 1. Finally, architectural proteins themselves generate tension 



if they attach to a DNA strand with fixed ends. When an architectural pro- 
tein attaches, DNA material is required to accomodate the deformation of 
the DNA chain near the protein. A simple calculation shows that the self- 
induced tension of a DNA strand of length L whose ends are held fixed a 
distance X apart and which contains a line density p of bending proteins is 
given by: 



( 



with (a 2 ) the average of the square of the bending angle. If the line density is 
of the order of one protein per hundred Angstrom and if (a 2 ) is of the order 
one, then this self-generated tension is of the order one pN (it should be 
possible to verify Eq. |8| in micro-mechanical measurements). It thus seems 
reasonable to assume that DNA is under tension for in-vivo conditions. 

Our results were derived with DNA-protein binding in mind, but the gen- 
eral aspects of our conclusions are related to work in other areas. We can con- 
sider the predicted aggregation of proteins under tension as a form of stress- 
induced decomposition. Stress-induced phase-separation of multi-component 
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systems is actually a classic phenomenon in solid-state materials for the case 
that different constituents have different elastic moduli fCahn, 196 1|| , just as 
envisioned in the present case. 

We conclude by noting that a number of technical objections can be raised 
against the method used in our study It is not really reasonable to assume 
that DNA twist plays no role, in view of the helical nature of DNA, and that 
we can be allowed to restrict ourselves to a purely two-dimensional arrange- 
ment. However, it is our current belief that the introduction of the twist 
degree of freedom introduces qualitative, but not quantitative, modifications 
to the results reported here. The assumption of internal structural rigidity 
of proteins also is questionable. For instance, it is well known that many 
enzymes can undergo stress-induced structural changes. Enzymes bound to 
DNA also may change their structure under stress. Finally, it must be kept 
in mind that the WLC model neglects the possible influences of intervening 
structures, such as other bound proteins, on the tension-induced interaction 
betwen two bound proteins, and that looping and other manifestations of 
DNA self-interaction are ignored. Despite all these caveats, we feel that the 
basic result — stress-induced aggregation of DNA-bound proteins — is robust 
and should remain present in more realistic models. 
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Appendix A 

We model the DNA strand as a rod that is free to move in two dimensions. 
The shape of the rod is parameterized in terms of an angular variable 9, 
which will vary with distance, s, along the rod. This parameterization is 
pictured in Figure || The energy of a configuration of the system is given by 
the following expression. 




(A.l) 
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The rigidity against bending is parameterized in terms of a stiffness param- 
eter K, while F is the tension applied to the rod. The quantities referred to 



in the text are related to K and F by £ (F) = JK/F and £ p = K/k B T. In 
the "classical" limit, which applies at very low temperatures, is determined 
by the extremum equation 

<)H -K^- + Fsm9(s) = (A.2) 



56 (s) ds 2 

This equation is solved by quadratures by noting that it is identical to the 
first order differential equation 

„\ \ 2 

= k (A.3) 
with k a constant. From Eq. |A.3| we immediately obtain 

' W ±ds (A.4) 




f (A -cost 



with A another constant. Integration of [A.4| yields an implicit form for 9(s), 
in which s is represented as a function of 9. The specific function is a combi- 
nation of elliptic integrals. There is no evident way to invert the expression 
thus derived to obtain 9 as an explicit function of s. 

On the other hand, when the bending of the DNA is small, so that 9 is 



small, the cosine function in [A.l| can be expanded. The zeroth order term is 
a constant, "background" energy. The energy of the system, as it depends 
on the configuration of the DNA, is given by 



H[9(s)}= ds\-{^\ +-9(s) 2 \ (A.5) 




The statistical mechanics of the system controlled by this energy can be 
analyzed in complete detail. Here, the extremum equation is 

-K^-P^- + F9(s) = (A.6) 
ds z 

the solution of which is 

9 (s) = Ae s/ ^ F) + Be~ sl ^ F) (A.7) 
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Where f (F) = VKF = yj£ p k B TF, and A and B are constants. 

As an example of the use of the small angle formulas A. 5 - A. 71 we calculate 
the free energy cost of the presence of two proteins that kink the DNA to 
which they are attached. These insertions enforce a discontinuity in d9/ds at 
the location of each kink. We assume that the magnitude of the discontinuity 
at the first kink and a + at the second. To simplify the analysis, we 



is a. 



place the first kink at s = —1/2 and the second kink at s = 1/2. The 
calculation of the classical solution subject to these constraints is relatively 
straightforward. One finds 



e(s) 



sinh 



sinh 



6>e 



S < -H 



sinh -v / (s- 



sinh i 



'£-1 
K 



-5< s <! 



< s 



(A.8) 

According to |A.8j , the angle is equal to 9 < immediately to the right of s = 
—1/2 and 9- immediately to the left of s — — 1/2. By the same token 9(1/2 — 
e) = 9 + and 9(1/2 + e) = 9 > . The kinks give rise to a discontinuity at 
s = ±1/2, which means that 9 < ^ 6L and 9 > ^ 9 + . We write 



9. 



— a. 



6+ + a_ 



(A.9) 



Then, the total energy, E, of the bent rod, as given by Eqs. [A.5| , |A.8| and 
A.9| , is given by 



E = -VFK 
2 



a_) 2 + (0+ + a + f + 9% coth J + 9 2 _ coth J + 2 ^ + ^_ 

V V sinh J j^l 

(A. 10) 



If we replace 9 + by E + A and by E — A, then Eq. [A.10| becomes 
£(E,A) = 



v^E 2 



eV K +1 _|_ A^2 eV g -1 
sinh \ sinh \J~^l 



+ E (ot + — ot_) + A (a+ + a_) + + a 2 



(A.11) 
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The partition function, Z, of the system is given by 



Z = J J dEdAexp (— /3£7 (E, A)) 



(A.12) 



where (3 = l/k B T. This double gaussian integral is evaluated by completing 
squares. Taking the log and multiplying by k B T to obtain the free energy, 
F, we find 



Vfk r 

4 I 



2 i 2 



2a+a_e 



In 1 



(A.13) 



The first term on the right hand side of the expression for the free energy 
in Eq. |A.13j is the mean field approximation to the free energy cost of two 
kinks in the rod. The second term is a partial contribution to the free energy 
due to fluctuations, in particular fluctuations in the angles 6 + and To 
complete the evaluation of the free energy associated with a pair of kinks, 
we must now average over all other fluctuations in the rod. To do this, we 
expand 9{s) in normal sinousoidal modes, subject to the condition that those 
modes do not alter the values of 9 immediately to the right or the left of the 
locations of the kinks. The contribution to the free energy that depends on 
the distance, /, between the kinks arises from the fluctuations in that region. 
This fluctuation sum reduces to 



k B T X> 



,n=l 



mr\ 2 F 
TJ + K 




k B T 



In 1 



(A.14) 



In arriving at the expression on the right hand side of Eq. |A.14| , an infinite 
contribution to the sum over logarithms was subtracted off. This "regular- 
ization" of the sum is consistent with standard field theoretical approaches 
to the evaluation of the free energy of systems such as the one under con- 



sideration here. The right hand side of |A.14| exactly cancels the fluctuation 



contribution to A7L5 . Thus, the free energy of the two-kink system consists 
entirely of the mean-field contribution. 



Appendix B 

In the limit that the kinks occupy an infinitesimal portion of the DNA, the 
free energy cost of a pair of them is, as noted in the appendix above, given 
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entirely by mean-field theory. There is a mechanism leading to fluctuation 
induced interaction energy, and that is the modification of the bending mod- 
ulus by the proteins that cause the kinks. This modification can be modeled 
as follows. In the small-angle approximation, one replaces the energy expres- 
sion in [A.5| by 

Hvw-J*{^(*®f + frr} (b.d 

where the symbol K\ 2 stands for the two possible values, K\ and K 2 , that 
the bending modulus can now take on in the various regions. The subscript 
2 applies in the regions that are in the immediate vicinity of the proteins, 
while the subscript 1 is appropriate everywhere else. This means that K\ is 
the unsubscripted K in previous expressions. The effects of modifications of 
the bending modulus can be calculated separately from the consequences of 
the bending of the DNA by proteins. This means that we can ignore any 
discontinuities or alterations of the equilibrium value angle 8(s) that result 
from the presence of proteins. 

The regions in which the bending modulus takes on its two possible values 
are indicated in Figure §. The bending modulus is equal to K 2 in the two 
heavily drawn regions, while it is equal to K\ everywhere else. Matching 
conditions at the boundary between regions are that 8(s) and Kid6/ds are 
continuous. 

In order to evaluate the contribution of fluctuations, it is necessary to 
determine the eigenf unctions of the Hamiltonian [B.l| . One searches for solu- 
tions to the equation 

d 2 _9_ 
~d~s 2 

There will be two sorts of solution: even and odd. 



- #1,2-7^ + F9 (s) = X9 (s) (B.2) 



Even solutions 

These solutions have the form 

coshes < s < 1/2 

6(s) = { A cosh q 2 (s - 1/2) + B sinh q 2 (s — 1/2) 1/2 < s <l/2 + d 

C coshes - 1/2 - d) + D sinh qi (s - 1/2 - d) l/2 + d<s 

(B.3) 
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where 9 e (—s) = 9 e (s). 
Here 

and 



(F — A)/iTi 



Q2 



(F-\)/K 2 = qi 




Making use of the boundary conditions, we find 
A = coshgiZ/2 



B 
C 
D 



\ — sinh qil/2 
V K 2 

\~K~ 

cosh giZ/2 coshg 2 <i + W — sinhgiZ/2 sinhg 2 <i 
V *V2 




— cosh q±l/2 sinh g 2 <i + sinh q±l/2 cosh g 2 <i 



(B.4) 
(B.5) 

(B.6) 
(B.7) 

(B.8) 

(B.9) 



Odd solutions 

Here, the solutions are 

{sinhgis < s < 1/2 

Acoshq 2 (s - 1/2) + Bsinh.q 2 {s -1/2) 1/2 < s < 1/2 + d 

Ccoshgifs - 1/2 - d) + Dsinh q 1 (s - 1/2 - d) l/2 + d<s 

(B.10) 

where o (—s) = —9 (s), and, in this case, the coefficients are given by 



.4 
B 

C 

D 



sinh q\l/2 




cosh q\l/2 



sinhgxZ/2 coshg 2 <i + J — coshgiZ/2 sinhg 2 <i 

K 2 



sinh q\l/2 sinh q 2 d + cosh q\l/2 coshg 2 <i 




(B.H) 
(B.12) 

(B.13) 

(B.14) 



The ultimate goal of the calculation is the so-called Fredholm determi- 
nant of the differential operator on the right hand side of Eq. [B.2|. This 
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determinant is readily evaluated with the use of a by-now well-known trick 
Coleman, 1985|| . The trick yields the following expression for the contribu- 
tion to the free energy of fluctuations in 9(s). 

k B T 



A 



ln(0 e (L) 9 (L)) 



(B.15) 



where the arguments of the logarithm are the even and odd solutions dis- 
played in |B.3| and [HTTP , where A has been set equal to 0. There are, in 
addition, normalization factors. The factor that multiplies the even solu- 
tion sets the magnitude of that solution equal to one at s = 0. The factor 
multiplying the odd solution sets the slope of that solution equal to unity 
at the origin. The argument L is the total length of the segment to which 
the proteins are attached. In the following development, the limit L — > oo 
is taken, and the contribution to the free energy relevant to the interaction 
between the two attached proteins is extracted. 

In the limit of asymptotically large L the two factors in the argument of 
the logarithm in Eq. B.15| are 



-qid 



cosh q 2 d + sinh q 2 d 




and 



e qid < cosh q 2 d + sinh q 2 d 




[BA6) 



(B.17) 



After some algebra we find for the dependence of the free energy of the 
system on the parameters /, d, and the K^s 

k B T 



where 



/ 



2(q 2 -qi)d 



A 



+ 



ln(/) 



(B.18) 



-2 qi d ginh 2q2 d (yKl - s/K^ 



VKjr 2 



^ e -2 qi d ginh 2 ^ _ K ^ 



-2qil 



Kl K 2 r - j (B-") 

Note that the dependence of the argument of the logarithm on the separa- 
tion, /, between the two regions of differing stiffness constant is as A— Be~ 2qi1 . 
This implies an always attractive interaction. 
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Table 1 

Enthalpic energy of interaction 



F(pN) 




Knth(/ = o A) 


Knth(/ = 50 A) 




£ p = 500 A 


in kcal/mole 


in kcal/mole 


0.1 


221 A 


0.82 


0.66 


1.0 


70 A 


2.6 


1.27 


10 


22 A 


8.2 


0.85 



Table of the enthalpic interaction strength, as given by the last term on the 
right hand side of Eq. (|l|) , for physically reasonable values of the temperature, 
the persistence length £ p , and the tension, F. For the dependence of the 
binding cooperativity range, £(F) on tension and other parameters, see Eq. 
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Figure 1: Strand of DNA containing two proteins. The proteins induce 
"kinking" on the DNA strand. Illustrated are the case of antisymmetric and 
symmetric configurations of the two bound proteins. The distance between 
proteins is expressed in terms of the locations, s± and S2, of each of them on 
the strand. 

Figure 2: The attractive interaction for an antisymmetrically oriented pair 
of bound proteins. Plotted is the total energy of the configuration in units of 
ksT, multiplied by the ratio £(-F)/£ p , where £(F) is the tension-dependent 
length scale, defined in Eq. ||] and £ p is the persistence length of the DNA 
strand. Curves are displayed for various values of the "kink angle", a, as 
shown in Figure [1] 

Figure 3: The repulsive interaction between two proteins bound in a sym- 
metric configuration. Plotted is the total energy of the configuration in units 
of ksT, multiplied by the ratio where £(F) is the tension-dependent 

length scale, defined in Eq. |2] and £ p is the persistence length of the DNA 
strand. Curves are displayed for various values of the "kink angle", a, as 
shown in Figure [1] 

Figure 4: The entropic interaction potential between two identical bound 
proteins resulting from the effect of each of them on the bending modulus 
of the strand of DNA to which they are attached. The interaction is plotted 
in units of ksT under the assumption that the region of affected DNA is 70 
Along, and that the applied tension is 1 pN. 

Figure 5: The "loop" or L configuration of two symmetrically bound proteins, 
when the bend angle enforced by a bound protein is greater than it/ 2. This 
is the preferred configuration when the applied tension is small. 

Figure 6: The stretched, or S configuration of two symmetrically bound pro- 
teins, when the bend angle exceeds ir/2. This is the preferred configuration 
at high levels of applied tension. 



20 



Figure 7: The overall extension of a very long strand of DNA containing two 
identical symmetrically bound proteins, the bend angle, a, of each of which is 
equal to 2.1. The extension, X, is relative to the fully extended DNA strand, 
measured in units of (2K/F) 1 ^ 2 , where F is the applied tension and K is the 
bending modulus. The vertical axis is the applied tension in units of K/21 2 
where L is the distance along the DNA backbone between proteins along the 
DNA strand. See Appendix A for a full discussion of the parameters utilized. 



Figure 8: The parameters in Eq. |A.1| . The quantity 6(s) is the angle between 



the flexible strand and the horizontal axis, while s is arclength. 



Figure 9: The regions of varying stiffness in a strand containing two bound 
proteins that alter the bending modulus where they attach to a strand of 
DNA. The values that the bending modulus, K, takes in the various regions 
are indicated. See Eq. [B.l| and surrounding text in Appendix B. 
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